Are there arbitrarily long arithmetic progressions in 
the sequence of twin primes? II 



Janos Pintz* 



1 Introduction 

Until very recently the twin prime conjecture seemed to be completely inac- 
cessible with available methods of number theory. Four years ago, in a joint 
work with D. Goldston and C. Yildirim [GPY] we proved that assuming a 
very regular distribution of primes in arithmetic progressions we obtain a 
somewhat weaker result 

(1.1) liminf(p n+ i - p n ) < 16, 

n— >oo 

where p n denotes the n th prime. 

The condition was that the level $ of distribution of primes, that is, an 
exponent, such that for e > 0, A > 



(1-2) E 



q<N#-£ (a,q)=l p = a (mod q) 
p<N 



max 

a 



N 



(log Ny 



satisfies i? > 0.971; an assumption, just slightly weaker than the strongest 
possible hypothesis i? = 1, the well-known Elliott-Halberstam [EH] conjec- 
ture. The best known admissible value for the relation •& = 1/2, is the 
celebrated Bombieri- Vinogradov theorem. In the same work we proved that 
any i? > 1/2 would yield infinitely many bounded gaps between primes, that 
is 

(1.3) hminfO n+1 -p n )<C(tf). 

n— >oo 

Since we can not suppose concerning the level of distribution anything 
beyond the Elliott-Halberstam conjecture (which would yield also (jl.ip ) the 
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question arises, whether we can deduce the twin prime conjecture itself - or 
perhaps even a positive answer for the question in the title of our paper - 
from a hypothetical very regular behaviour of some related sequences (pos- 
sibly including the primes themselves) similarly to the Elliott-Halberstam 
conjecture for primes. 

We will give under some plausible hypotheses an affirmative answer for 
this question, including the existence of arbitrarily long arithmetic progres- 
sions in the sequence of twin primes. Surprisingly the required distribu- 
tion level is just # > | (or with more precise arguments even slightly less, 
3 > 0.7284), but the sequences for which a suitable analogue of (|1 .3j) is 
needed, are not just the primes but all the following ones: 

(1.4) logp, A(n), A(n)A(n + 2), A(p + 2)logp, A(p-2)logp, 

where p denotes always primes, V the set of all primes. 

We have to note that while for A(n) we know the analogue of the Bombieri- 
Vinogradov theorem 



(1.5) max ^( n ) 



n=a (mod q) 
<N 



N 



log^iV 



with = 1/2 (the proof of Vaughan [Vau], Theorem 4, with /u(n) in place of 
A(ra) can be easily modified to yield (|1.5p ). our knowledge about the other 
sequences is much more limited, since the following problems are still open 
(see [Cho], [Hil], [Iwa]). 

Problem 1. Is £ A(n)A(n + 2) = o(x) (see [Cho, (341), p. 96; the quantity 

n<x 

O(X) there is a misprint, it has to be replaced by o(x)]), or even whether 
we have an absolute constant c such that 



(1.6) 



A(n)A(n + 2) 



n<x 



< (1 — c)x for x > xq. 



Problem 2. Are there infinitely many primes with X(p + 2) = —1 (or 
A(p-2) = -l)7 

Problem 2'. Are there infinitely many primes with \{p + 2) = 1 (or 

A(p-2) = 1)? 

It may be worth to mention that the author succeeded to show very 
recently [Pin2] the existence of a positive even d < 18 such that \{p+d) = —1 
for infinitely many primes p. 
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We can more generally work with any fixed positive even integer h in 
place of 2, so the same argument works for the generalized twin prime con- 
jecture too. 

Theorem 1. Suppose that with a $ = i?i > 3/4, the relations (|1.2|) . (|1.5j) . 

further the analogues of (|1.5p with X(n) replaced by X(n)X(n + h), X(p — 
h)logp and X(p + h)logp hold, where h is any positive even integer. Then 
p + h is prime for infinitely many primes p. 

The result can be proved with somewhat more effort under a slightly 
weaker condition for the distribution of the above sequences (i? > 0.7284). 
Further we can give a lower estimate for the number of generalized twin 
primes up to N which is just a constant factor weaker than the expected 
number 

(1 . 7) So(fc) _z_, e „ W :=n(i-£fn(i-(^)- 

Theorem 2. Suppose that the conditions of Theorem^ are valid for a •& > 
0.7284. Then with an absolute constant c we have for any even h > 

(1.8) #{P<N; p, P + heV}> C< f fl N forN>N 1 . 

log TV 

The estimate (|1.8p implies that as shown in [Zho] , or in greater generality 
in [Pinl], the method of proof of Green-Tao [GT] can be adapted to this 
situation, yielding 

Theorem 3. Suppose the conditions of Theorem [7] for a i?i > 0.7231, that 
is, that all 5 functions in (jl.4p have distribution level $i, with 2 replaced by 
h. Then for any even h > there are arbitrarily long arithmetic progressions 
such that p + h is also prime for all elements of the progression. 



2 Proof of Theorem [T] 

In the work [GPY] we introduced for /c-element sets % = {/ij}f =1 the function 
(2.1) 

1 / R\ k+l k 

A R ( n] n,l) := -——^ ^Hlog-r) , Pn(n) = l\(n + h i ), 

^ >' d<R ^ ' i=l 
d\P n (n) 
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which we will use now in the special case k = 2, % = {0, h}. However, 
instead of a n = A|,(n) as in [GPY] we will weight now the integers with 

(2.2) b n = a n (l-\(n))(l-\(n + h)) > 0, a n = A R {n; H, I) 2 . 
The singular series 

(2.3) e( «) = n(i-^)(i-i)"'. 

where f p (%) denotes the number of residue classes occupied by %mod p, 
reduces now to &o(h), given in fj 1 . T|) . The /c-tuple % is called in general 
admissible if v p = v v (Ji) < p for all primes p, equivalently, if 6("H) ^ 0. 
We remark that for any admissible /c-tuple % = T~Lk, hence also in our case 
H = {0, h} we have 

(2.4) em k ) > n l n (i - ~) (i - 1) k > <*(*)■ 

p<2k F p>k V F/ V F/ 

We quote from [GPY] as our first two lemmas Propositions 1 and 2 (see 
(|2.14p - (l2.15p ). which will form the base of our argument. We will restrict 
ourselves for the case H = %i = %2i but keep the parameter /, which will be 
used in Section [3] to show the stronger Theorem [2l We will use the notation 
9(n) = logp if n = p € V, 9{n) = otherwise, n ~ N for n E [N, 2N), C an 
absolute constant whose value may be different at different occurrences. 

Lemma 1. // R < iV^log N)~ c then 
(2-5) 

I Y, A«(n; n, h)^n M ) = (e(«) + o( i)) I h ) % g *fJlZ - 

Lemma 2. // R < ]\[(®~ e )/ 2 then for any h £ H we have 

( 2 - 6 ) 4 E Ar ^ n ' ? i) A ^( n ; + h) = 

fKM-L. n^f^+h + 2\ (logfi) fc +^+^+ 1 
= (6(?£)+o(l))^ + 1 j (fc + , i + , 2 + 1)r 

We will need an analogous lemma for the sequences 
(2.7) /(n) = A(n), A(ra)A(ra + /i), 9(n)X(n + h), \(n)8(n + h), 
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where we use the hypothesis that f(n) satisfies the analogue of (jl.5p . that 



is. 



(2i 



E ™f x E f( n ^ 



q<N®~ 



n=a (mod q) 
n<N 



<-s,A 



N 



loe A N 



Lemma 3. Suppose (|2,8p and f(n) <C (logA^) c . If A > arbitrary, R <C 
N^^ 6 ^ 2 , then we have for any % = {/ij}i=i 



(2.9) S f {N) = ^ E A ^ n ' H ' h)A R (n; H, h)f{n) « 



N 



Ior a N' 



where the constant implied by the <C symbol depends on k,li,C,A,e. 

Proof. For any squarefree m and % = {hi}f =l the number v m = v m {j€) of 
the solution of the congruence 



(2.10) 



W_(n + hi) = (mod m) 



i=l 



satisfies by the Chinese remainder theorem 

(2.11) y m = \[u p <k^ =d k (m), 

where oj{m) denotes the number of prime factors of m, d/.(m) the number 
of ways to write m as a product of k integers. Interchanging the order of 
summation we can write Sf(N) with the notation K = 2k + l\ + I2 as 



(2.12) 



n E E 

d<Re<R 



log^i? 



N 



/x(d)Me) (logjT^ (log f 
(k + h)l{k + 

E ( E l)"*E N (q), 



E /w 



[d,e]\P H (n) 



q<R2 s q=[d,e] 



where (for q < N) 

(2.13) E N (q) : = max 



E /w 



n~7V 
n=a (mod g) 



iV(log iV) 



c 
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Using our hypotheses we obtain as in (9.13) of [GPY] 
(2.14) 

log A 'i?/v- d 3k (q) 2 - , A 1/2 

S f (N) « Yl ^f 1 - E 1 E N(l) « 

\<R 2 q<R 2 

« ( Q ogN )^ N()j0gN) cN_ V 2 <<; (logAr) ^+(9^+C-A)/2_ 

-W \ log iV / 

□ 

Using the notation 

(2.15) B :=Bo(R,H,kJ)=^y^^e(H), 

we have by Lemmas [T] and [3] in the special case 7~L\ = %2 = {0, h}, k = 2, 
l x = l 2 = I = 

(2.16) B := b n ~ a n ~ Bq := . 

n~JV n~JV 

On the other hand we obtain from Lemmas [2] and [3] with the same choice 

n = {o,h}, i l = i 2 = o 

(2.17) 

P* -=Y1 b n (0(n) + 9(n + h)) = 

= 2 ^ a n {(l - \{n + h))9(n) + (1 - \(n))9(n + h)} ~4-2 J B -^^. 

3 

In order to have at least one prime pair p,p + h with p £ [iV, 2iV) we 
need to show with R = iV^)/ 2 

(2.18) P* - Blog(3N) > 0, 
which is really true if 

80-e 

(2.19) 3^ >1 + £ - 

This is trivially true for any fixed 1? > 3/4 if e is sufficiently small and iV 
sufficiently large. This proves Theorem [TJ 
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3 Proof of Theorem [2] 

The proof of Theorem [2] needs a relatively simple modification, which allows 
to weaken slightly the constraint i? > 3/4. This can be achieved - similarly 
to Section 3 of [GPY] - by applying a linear combination of the weights 
A#(n;"H, I) with I = and I = 1. More precisely we define 

(3.1) a' n :=a' n (n;u) = ^A R (n-,n,0) + ^^A R (n-,n,l)^ , 

where u is a real parameter to be chosen optimally later. In our case k = 2 
we obtain with the notation Bq in (|2.16p for % = {0, h} from Lemmas [1] and 
[2] in this case with the analogue b' n = a' n (l — A(n))(l — X(n + h)): 

(3.2) B'(N,H,u) :=J2 b n~J2 a n~ B °( 1 + 2u+ 2 ^ ' I ) ■ 

The analogue of the evaluation of (|2.17p is now 

(3.3) P' : = b' n (6(n) + 9(n + h)) ~ 

~2j fl ! l ((l(fl) + fl(n + / l ))~ 

JT , , r, (1 6n 18u 2 
~ 4B log R - H 1 

This means that we have to assure 

(3.4) P'-B'log(3N) > 

which will hold if we can find a u with 

4 „ 9u 2 \ / „ 3m 2 



(3.5) fcW=tf ._ + 3u + _>_M + a» + —>>0 

if we choose e sufficiently small. The optimal choice for u is u = uq = 
(\/34 — 2) /9, which yields a fixed positive lower bound cq for = g uo (i?) 
if 

(3.6) 1? > i?i = 0.7231. 

This is enough to obtain a weighted estimate for the number of generalized 
twin primes in [N, 2N) 

(3.7) 1 J] < log(3Af) > Cl 6o(/ l )log 3 fl. 

n,n+heV 
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However, if n and n + h are both primes then for % = {0, h} clearly 

(3.8) K R {n-H,l) = —L-(logi?) fc +' = ^A_(log i?) 2 +', 
consequently 

(3.9) a n (H,u )= (^-^Ylog 4 ^ 



2 

which by (|3.6p and (|3.7p leads to the estimate 

L J log R log iV log iV 

Remark. If we are allowed to choose a bigger then the lower estimate 
(|3.10p will improve but we do not reach the expected number corresponding 
to C3 = 1 even supposing ■d = 1, the Elliott-Halberstam conjecture. 
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